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I. INTRODUCTION 


A measurement outcome statistic of a general quantum measurement process is described 
by a positive-operator valued measured-- (POVM). Such a description of quantum measure¬ 
ments enables us to formulate both projective and non-projective measurements. However, 
for the price of such a generality, some POVMs contain redundant information irrelevant to 
the system and, due to such redundancies, for a given POVM there exist infinitely many 
POVMs that bring us the equivalent information about the system. 

To clarify this point, let us consider the following example of a pair of discrete POVMs 
A and B: 


A = {Ao, Ai}, 

O<A 0 <I,A 1 = I — A 0 , 

B = {Boo, Bqi, Bio, Bn}, 

Bio = A Ai, Bn = (1 — A )Aj. 

Here A G (0,1) and A* and B^ are bounded operators on a Hilbert space V. corresponding 
to the measured quantum system and O and / are zero and identity operators, respectively. 
The measurement corresponding to the POVM B can be realized, for example, as follows: 
perform the measurement A, whose measurement outcome is i G {0,1}, and generate a 
binary random variable j, which gives 0 with a probability A and 1 with a probability 1 — A, 
and the measurement outcome of B is given by a pair {i, j). Apparently B = {Rp} contains 
redundant information, which is in this case the classical random variable j, and A and B 
give the equivalent information about the system. 

Then it is natural to ask whether we can reduce such redundancies for a given POVM and 
how far such reductions proceed. In this paper, to formulate and answer this question, we 
introduce a concept of a minimal sufficient POVM which corresponds to the least redundant 
POVM among the POVMs that give the same information about the system. The main 
finding of this paper (Theorem 0 is that for any POVM on a separable Hilbert space there 
exists a minimal sufficient POVM that has the equivalent information about the system 
and that such a minimal sufficient POVM is unique up to almost isomorphism, which is the 
relabeling neglecting null sets. 

The concept of the minimal sufficient POVM has two origins: a minimal sufficient statis- 
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tics^ in mathematical statistics and a fuzzy equivalence relation^ - - known in quantum 
measurement theory. Two POVMs are fuzzy equivalent if one of them can be realized by a 
classical post-processing of the other. The concept of the fuzzy equivalence relation is used 
in the definition of the minimal sufficient POVM. The minimal sufficiency condition for the 
POVM can be regarded as a generalization of the minimal sufficient statistics in the sense 
that we consider a more general class of post-processing which includes taking statistics. 

This paper is organized as follows. In Sec. d3J we show some preliminary results on 
mathematical statistics and quantum measurement theory which will be used in Sec. IIII1 In 
Sec. EH we define two definitions of the minimal sufficient POVM corresponding to two kinds 
of fuzzy equivalence relations and introduce a sufficient statistic called a Lehmann-Scheffe- 
Bahadur (LSB) statistic for a given POVM. It is shown in Theorem [4] that the LSB statistic 
is a minimal sufficient statistic for the given POVM and a POVM induced by the LSB 
statistic is a minimal sufficient POVM. After introducing a concept of almost isomorphism 
between POVMs, we show in Theorem [5] that a POVM has an equivalent minimal sufficient 
POVM and that such a minimal sufficient POVM is unique up to almost isomorphism. In 
SecED we consider discrete POVMs and prove that for a given discrete POVM there is a 
unique equivalent minimal sufficient POVM that is discrete and has no zero elements. In 
Sec. IS we apply the main results to information conservation conditions proposed by the 

author- 1 — 


II. PRELIMINARIES 

In this section, we introduce some preliminary concepts and results on mathematical 
statistics and quantum measurement theory. 

A. Sufficient and minimal sufficient statistics 

Let (fl, £§) a measurable space. A family of probability measures V with the outcome 
space (Q, is called a statistical model on 

Let V be a statistical model on (Q, &) and let A be a probability measure on (Q,&). 
V is said to be dominated by A, denoted by V A, if every element P G V is absolutely 
continuous with respect to A. A statistical model V is said to be dominated if there exists 
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a probability measure that dominates V. A dominated statistical model V has a countable 
subset {Pi}i >i C V such that A := JA>i dominates V if Cj > 0 and c t — 1 liolcR 1 . 
Such A is called a pivotal measure for V. 

Let V = {Pe}e&e be a statistical model on an outcome space (VL,38) and let (£I t ,38t) 
be a measurable space. A ^/^j’-measurable map T : hi —* fly is called a statistic. The 
set of ^/^T-measurable maps (statistics) is denoted by M ((Q, 38) — > (Qt,38t)). T is 
said to be sufficient with respect to the statistical model V if for every E e 38, there 
exists a ^^-measurable function P(E\-): fir —> [0, oo) such that Pg(E\t) = P(E\t ) Pj- a.e. 
for each Pq e V, where Pg(E\t) is the conditional probability of Pq for given T = t and 
n T (') := Pe(T~\-)). 

Let P and Q be probability measures with an outcome space (hi, 38). Suppose /: [0, oo) —> 
1 be a strictly convex function such that /(1) = 0. Taking a cr-finite measure /i dominating 
P and Q, we write Radon-Nikodym derivatives as p{x) := dP/dp{x) and q{x) := dQ/dp(x). 
An /-divergence 12 - 1 ^ between P and Q is defined by 

D f (P,Q ) := £/ q{x)dp{x), 

where the integral on the RHS is independent of the choice of p and the following conventions 
are adopted: 


0 - 


/*(0) := lim 


t—>oo 


/ (o) =J,m 


m 


o-r(o) 


0 . 


For later use, we fix an / such that Df(P, Q) < oo for each P and Q. An example of such an 
/ is given by f(t ) = — l) 2 , and the corresponding /-divergence is the Hcllinger distance 


H(P,Q ) = j (Vp<a) - d//(x) < 2. 

The following theorem gives necessary and sufficient conditions for the sufficiency of a 
statistic. 


Theorem 1. Let T* = {Tgjeee be a statistical model on a measurable space (hi, 38) dominated 
by a pivotal probability measure X, and let T: Q —» Q T be a 38 / 38t- measurable statistic and 
Pq (•) := Pq(T~ 1 (-)). Then the following conditions are equivalent: 

(i) T is sufficient; 
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(it) for every Pg G V, there exists a -measurable real valued function gg(- ) such that 


dPg 

dX 


( x ) = go(T(x)) A-a.e.; 


(in) for every Pg G V, 


dPg 

dX 


dP T 

W = A-a.e., 


where A T (-) := A(T" 1 (-)); 


H D f (P 6l ,Pg 2 ) = D f (Pl,Pl) (VP ei yPg 2 G P). 

Proof. The equivalence (i)-v^(ii)-v^(iv) is well-known4*JA'A2T4. The implication (iii)=^(ii) is 
obvious. Let us show (ii)=^(iii). For each F G we have 

p e( F ) = f XF{T(x)) < ^(x)dX(x) 

= [ XF(T(x))g e (T(x))dX(x) 




' ^7 


XF{t)ge(t)dX T (t), 


where Xf(-) is the indicator function for F. This implies that gg{t) = dPff /dX T {t) A T -a.e. 
and we obtain (iii). □ 

Let V be a statistical model on A statistic T G M((Q, &) —> {Xl T ,Sd T )) is said 

to be minimal sufficient if T is sufficient and for each sufficient statistic S G M — )■ 
(fis, &s)) there exists a map / G ^ds) —> (XIt,^t)) such that T(x) = f(S(x)) 

■p-a.e. A minimal sufficient statistic can be interpreted to capture the information about 
the statistical model V in the least redundant manner. 

The following theorem gives a sufficient condition for the existence of a minimal sufficient 
statistic. 

Theorem 2 (Lehmann and Scheffe^, Bahadur-4). Let V = {Pg}g^e be a statistical model on 
an outcome space (Q,3d). Suppose that there exists a countable subset {Pgi}i>i C V dense 
in V with respect to the following metric: 


d(P,Q ) := sup \P{E)-Q(E)\. 


( 1 ) 
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Then a statistic T e M (ffl,88) —> (R°°, ^(R°°))) defined by 

T w : = (^>L e R ” 

is a minimal sufficient statistic. Here (R°°, <^(R°°)) is the countable product space of the 
real line (R, <5^(R)) and X := c t Po i with q > 0 and JV >:L Q = 1. 

B. Positive-operator valued measure 

Throughout this paper, we fix a separable (i.e. dim % < K 0 ) Hilbert space TL and denote 
the set of bounded operators on TL by C(Ti). A positive trace class operator p with unit 
trace is called a state on TL. and the set of states on TL is denoted by S(TL). 

Let (fl, 88) be a measurable space. A positive-operator valued measure (POVM) A is a 
mapping A: 8$ —>• C(TL) such that 

(i) A(E) > O for each E e 88; 

(ii) A(fi) = /; 

(iii) for each countable disjoint family { E.- L } c 88, A(|J,. E t ) = JA AL^Ef), where the RHS is 
convergent in the sense of the weak operator topology. 

Here, O and / are the zero and identity operators, respectively. The triple (f2, 88, A) is also 
called a POVM. 

Let (H, 88, A) be a POVM. For each p e S(TL) we define a probability measure Pff) with 
the outcome space (fl, 88) by P A (E) := ti[pA{E)\ for each E € 88. Then A induces a natural 
statistical model V A := {Pf} P £S(H)j which is the set of possible outcome distributions when 
we perform the measurement A. 

From the separability of TL. there exists a sequence {p*}j>i in S{TL) dense with respect 
to the trace norm topology. Taking arbitrary {q}j>i such that q > 0 and JA>i A = e -g- 
Ci = 2~ l , we define a state p* := JV >:1 c t p t . Throughout this paper, we fix such {pi}j>i and 
P* • 

From the definition of p*, P A = Y2i>i c iP^ and the following proposition immediately 
follows. 

Proposition 1. Let ( fl,88,A) be a POVM. Then V A <C P A , i.e. P A is a pivotal measure 
for V A . 
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Due to the above proposition, the notions of A- a.e., V A -a.e., and P A - a.e. coincide, and 
thus we will use them interchangeably. 

C. Fuzzy preorder and equivalence relations among POVMs 

Let (Lli,38i) and (n 2 ,<^ 2 ) be measurable spaces. A mapping /c(-|-): 38 \ x fl 2 —* [0,1] is 
called a regular Markov kernel if 

(i) k(E [•) is ^-measurable for each E G 

(ii) ac(-I?/) is a probability measure for each y G fl 2 . 

Let (fli, 38 f) be a measurable space and let (fl 2 , 38 2 -, A 2 ) be a POVM. An A 2 -weak Markov 
kernel is a mapping /?(•[•) : 38 1 x fl 2 —> R such that 

(i) k(E [•) is ^-measurable for each E G 

(ii) 0 < n{E\y) < 1 A 2 -a.e. for each E G 38 

(iii) ft(f2i|?/) = 1 A 2 -a.e. and k(0| y) = 0 A 2 -a.e.; 

(iv) n{UiEi\y) = J2i K (Ei\y) A 2 -a.e. for each countable and disjoint {Ej} C 38\. 

By using the concepts of the regular and weak Markov kernels, we introduce fuzzy pre¬ 


order and equivalence relations as follows^—. 


Definition 1. Let (Ha, 38a, A) and (Hb, 38b, B) be POVMs. 


(i) If there exists a regular Markov kernel k(-|-) : 38a x f 1b [0,1] such that 



( 2 ) 


then we say that A is regularly fuzzier than B, denoted by A < r B. 


(ii) If there exists a B-weak Markov kernel re(-|-): 38axH b K- such that the condition ([2]) 
holds, then we say that A is weakly fuzzier than B, denoted by A P w B. 


(iii) If A P r B and B P r A, then A and B are said to be regularly fuzzy equivalent, denoted 
by A ~ r B. 
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(iv) If A -< w B and B A w A, then A and B are said to be weakly fuzzy equivalent, denoted 
by A ~ w B. 


Intuitively the relations A P r B and A -< w B mean that the measurement of A can 
be realized by a classical post-processing of the measurement of B. Apparently, A P r B 
(resp. A ~ r B) implies A P w B (resp. A B ). It is known&2^ that the regular and 

weak relations ~ r and (resp. P r and A w ) are equivalence relations (resp. preorder 
relations). Apparently, the regular relation A ~ r B implies the weak relation A ~ w B, while 
the converse does not necessarily hold. See Appendix [A] for an explicit example of POVMs 
that are weakly fuzzy equivalent but not regularly fuzzy equivalent. 

A standard Borel spaced is a measurable space Borel isomorphic to a complete separable 
metric space. We call a POVM with a standard Borel outcome space a standard Borel 
POVM. By further assuming the standard Borel properties of POVMs, the weak relations 
imply the corresponding regular relations as in the following proposition. 


Proposition 2 (Remark 4.1 of Ref. 8). Let A and B be POVMs. 


1. If A is standard Borel, A P w B A -< r B. 


2. If A and B are standard Borel, A ~ w B <=> A ~ r B. 


In the formulation of the minimal sufficient POVM, we will use the regular and weak 
fuzzy equivalence relations. 


D. Sufficient statistics for POVM 

In this subsection, we consider the sufficiency condition of a statistic for a POVM. We 
also show Lemma [2] which will be used in Sec. ED 

Let (VLai^AiA) be a POVM and let T G M((fi^, AS a) -a- (VL t ,H8t)) be a statistic. We 
define a POVM A T with the outcome space {Vt T ,HS T ) by A T (-) := A(r _1 (-)). Since A T {-) 
can be written as 

MF) = [ XF(T(x))dA(x) (E G & T ), 

J £Ia 

we have At A r A. T is said to be a sufficient statistic for A if T is sufficient for the statistical 
model V A = {P A } peS (n)- 
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The following lemma, the monotonicity of the /-divergence, states that the /-divergence 
is monotonically decreasing by the classical post-processing. 

Lemma 1 (Theorem 7.1 of Ref. 8). Let A and B be POVMs. 


(i) AP W B implies D f (P A ,P A ) < D f (P B ,P B ) (Vp,Va G 5(H)). 

(n) A~ W B implies D f (P A ,P A ) = D f (P B ,P B ) (Vp,Va £ 5(H)). 

The next theorem characterizes the sufficiency of a statistic for a POVM. 

Theorem 3. Let (Qa, &a, A) be a POVM and let T £ M.((LIa, A$a) ~> be a 

statistic. Then the following conditions are equivalent. 

(i) T is sufficient for A; 

(ii) for each p £ S(fK) there exists a AS T -measurable map g p (-) such that 


dP A 

Jpxi (iv) (v) (vi) * * * x ) = 9 p (T(x )) A- a.e.; 


(in) for each p £ S(fH), 


dP, 


API 


A dP Ar 

p ( x) = — (T(x)) A- a.e.; 


dP A J 


(iv) for each p,a £ S(U), D f (P A , P A ) = D f (P A p P A -); 

(v) A~ w A t . 


Furthermore, if A is a standard Borel POVM the above conditions are equivalent to 


(vi) A ~ r A t . 

Proof. From Theorem [Q and Lemma [T] the equivalence (i)-v^(ii)<tv-(iii)<tv-(iv) immediately 
follows. 

Let us show (i)=>(v). At A r A is evident from the definition of At- Since T is sufficient, 
there exists a conditional probability P(E\-), which is S§t ~measurable for each E £ A$a, 
such that 

PpE) = [ P(E\t) dPp(t). 

Since P(-|-) satisfies the conditions for the R^-weak Markov kernel from the definition of 
the conditional probability, we obtain A P w A T , and thus (v) holds. 
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(v)=>(iv) follows from Lemma [TJ 

Let us assume A is a standard Borel POVM. If we assume (v), from Proposition [2] we have 
A P r At- Since At P r A by definition, we have proved (v)=>(vi). The converse (vi)=>(v) is 
evident. □ 

The following lemma assures the existence of a POVM corresponding to the joint distri¬ 
bution for a given POVM and a regular Markov kernel. 

Lemma 2. Let (0], 38 f) be a measurable space and let (Cl 2l 38 2 , A 2 ) be a POVM. Let 
/c(-|-): 38 \ x0 2 4 [0,1] be a regular Markov kernel. Then the following assertions hold. 

1. There exists a unique POVM A 12 with the product outcome space (Oi x ff 2 , 38\ x 38 2 ) 
such that 

A\ 2 (Ei x E 2 ) = f n(Ei\y)dA 2 (y) [E] G 38\, E 2 G 38 2 ). (3) 

Je 2 

2. The canonical projection 

tt 2 \ Qi x Q 2 3 (x, y) hg y g 0 2 
is a sufficient statistic for A 12 . 

3. A 2 = (Aif)^ —r Ai 2 . 

4- For each p G S{TL), 

dPf 12 dPff 2 

d?pM y) = dFp {y) Al ^ e - 

Proof. 1. Let us consider the following mapping 

R(F\y) := K(F\ v \y), (F G 38i x 38 2 ,y e fi 2 ), 

F\ y :={igOi|(s,|/)gF}. 

Then k(-\y) is a probability measure for each y G 0 2 . To show the measurability of 
k(F |-) for each F G 38\ x 38 2 , let us define a class 

32>n := { F G 38\ x 38 2 | k(F |-) is ^-measurable } . 

Then is a Dynkin class, i.e. contains Oi x fl 2 and is closed under proper 
differences and countable disjoint unions. Since 

k(E 1 x E 2 \y) = k(E 1 \y)xE 2 {y) (#1 e S8 X , E 2 G 38 2 ), (4) 
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contains the family of cylinder sets := { E\ x E 2 \ E\ G SS\ , E 2 G SS 2 }, which 
is a multiplicative class. Then the Dynkin class theorem assures that = 3$\ x SS 2l 
i.e. k(-|-) is a regular Markov kernel. Let us define a POVM A 12 by 

A 12 (F) := f k(F\y)dA 2 {y), (F G x 3S 2 ). (5) 

Jn 2 

Then from Eq. Q, A 12 satishes the desired condition d3J). The uniqueness of A 12 
immediately follows from the Dynkin class theorem. 

2. Let us denote the conditional probability of P p 412 for given i\ 2 = y by P^ 12 (F\y) 
(F G x 382 )- Then for each E\ G and E 2 , E' 2 G 3$2, we have 

P^ 2 (E 1 x (E 2 n E' 2 )) = [ K(Ei\y)xE 2 (y)dP^ 2 (y) 

Je ' 2 

= [ P p V2 {E x x E 2 \y)dP p 2 (y). 

Je ' 2 

Thus we have 

P^ 2 (E 1 x E 2 \y) = K (E 1 \y) X E a {y) Pp 2 - a.e. 

and 

Pp, 12 ( E 1 x E 2 |y) = ac(E; 1 |?/)xe 2 (?/) A 2 -a.e. 

Therefore we obtain 

^f 12 (£, x E 2 \y) = P£\E x x £? 2 |j/) Pf 2 - a.e. 

To show that P p 4l2 (F|?/) can be taken independent of p, we define a class 

^:={LGJiX^ 2 | Pp 12 {F\y) = P^ 2 (F\y) P* 2 - a.e. } . 

Then S> p is a Dynkin class that contains the family of cylinder sets *0?. Therefore 
the Dynkin class theorem assures that P^ 12 (F\y) = P^ 12 (F\y) P^ 2 - a.e. for each F G 
3B\ x S$ 2 . This implies that 7 r 2 is a sufficient statistic. 

The assertion 3 is evident from Eq. fl5j) and A 2 = (Ai 2 ) n2 P r A 12 . 

The assertion 4 immediately follows from the assertion 3 and Theorem [3J □ 
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III. MINIMAL SUFFICIENT POVM 


In this section, we define a minimal sufficient POVM and show the existence and unique¬ 
ness of a minimal sufficient POVM equivalent to a given POVM. 

Definition 2. Let (0^, 38a, A) be a POVM. 

1. 38a, -4) is said to be ~ r -minimal sufficient if for any POVM (Ll B , 38 B , B ) regularly 
fuzzy equivalent to A there exists a measurable map f 6 M.((VL b , 38 b ) —» (VLa,38a)) 
such that Bf = A. 

2. 38a, A) is said to be ~ w -minimal sufficient if for any POVM (Ll B , 38 B , B ) weakly 
fuzzy equivalent to A there exists a measurable map f G M((Ob, 38 b ) — >■ (VLa,38a)) 
such that Bf = A. 

Since ~ w is a relation less restrictive than ~ r , any ~„.-minimal sufficient POVM is a 
~ r -minimal sufficient POVM by definition. 

We remark that the measurable map / in Definition [2] is a sufficient statistic for the 
statistical model V B due to Theorem C3 and Bf = A B. 

The minimal sufficiency of a POVM can be interpreted as a generalization of the minimal 
sufficiency of a statistic in the sense that we consider a more general class of classical post¬ 
processings which includes taking statistics. Intuitively, a minimal sufficient POVM A is 
the least redundant POVM among POVMs fuzzy equivalent to A. 


A. Lehmann-SchefFe-Bahadur statistic 


Let (Oyi, 38a, A) be a POVM. Corresponding to Theorem [2] we define a statistic T e 
M{fQ. A ,@A) ->• (M°°, 38(R°°))) by 


T(x) : = 


AP£ 


X) 


( 6 ) 


i> 1 


By taking authors’ names of Refs. [3j and 4|, we call the statistic (J6|) a Lehmann-Scheffe- 
Bahadur (LSB) statistic for A. 

The following theorem is the first main result of this paper. 


Theorem 4. Let (VIa, 38a, A) be an arbitrary POVM and let T be the LSB statistic given 
by ©. Then the following assertions hold. 
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1. T is a minimal sufficient statistic for V A . 

2. A t is a ~ w -minimal sufficient POVM, and therefore a ~ r -minimal sufficient POVM. 

Proof. 1. From Theorem [2J. it is sufficient to show that {P^}i >i is dense in V A with 
respect to the metric df, •) given by (P). For each p, a G S(fH) and each E G PS a we 
have 


Pp£)-Pp£)| = |tr[(p-ffM(£)]l 

< ||p-ff||trP(-E')|| < ||p-ff||tr, 


where || • || and || • || tr are the operator and trace norms, respectively. Thus we obtain 

d(P‘,P*)<\\p-a\\ K . (7) 

Since {pi}i >i is dense in S(fH) with respect to the trace norm, the inequality (J7]) implies 
that {P A } i>i is dense with respect to df, •) in V A , and we have shown the minimal 
sufficiency of T. 


2. From Theorem [3] and the sufficiency of T, we have 

dPf T 




x = 


pi 


■(T(x)) A-a.e. 


dP AK ~ J dP A : 

for each i > 1. From the dehnition of the LSB statistic (1511 . this implies 


T[x) = 


( dP Ar 

(z^ (T(x)) 


A-a.e., 


i> 1 


(dP Ar \ 

t= (5^ (t) J cl Ar ^ (8) 

To show the ^-minimal sufficiency of At, let (O^, PS B , B) be an arbitrary POVM 
weakly fuzzy equivalent to At. Since the outcome space (K°°, P$(W°°)) for At is a 
standard Borcl space, there exists a regular Markov kernel /c(-1-): PS{E2°) x fig —y [0,1] 
such that 

A t {F) = [ n(F\y)dB(y) (9) 

J Clg 

for each F G Pd{R°°). We deffire a POVM (M°° xfig, Pd{R°°) x SS B , C) by 

C{E) := [ K{E\ y \y)dB{y) (E G x PS B ). 

J fig 
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( 10 ) 


Then from Lemma [2j we have C ~ r B — C n „ and 


dP? dP B 

dpe^ = -dpsW C- a.e„ 

P * P* 

for each i > 1, where 7Tb: M°° x fl# —> LIb is the canonical projection. On the other 
hand, from Eq. (j9]) we have C^ At = A T B ~ r C, where tt At : M°° xO B -) M°° is 
the canonical projection, and Theorem [3] assures that 


dP? dPff T , 

- 4 , (t, y ) = — 4 —(t) C-a.e. 

iP?. T 

for each i > 1. From Eqs. (jSJ), (ITOT) and (ITT]) . we obtain 


where / 6 M((Q S ,^ S ) ->• (M°°, 


D ))). Then for each E E 


( 11 ) 


dP B 

t = I ^|(z/) I =: /(y) C-a.e., (12) 

/ i>i 


3 ), we have 


B f (E) = B(f-\E)) 

= C(M°° x / _1 (E)) 

= / Xe{I (y))dC(t, y) 


J R°°xO s 

= / 

jR°°xn s 

= 4r(£), 


XE(t)dC(t,y ) 


(13) 


where we have used Eq. (fT2l) in deriving the equality (fT3[) . Thus we obtain Bf = At, 
and we have shown the ~ u ,-minimal sufficiency of At- □ 


B. Uniqueness up to almost isomorphism 

In order to formulate the uniqueness of the minimal sufficient POVM, we introduce a 
concept of almost isomorphism as follows. 

Definition 3. Let (Q-i, 3§i, Ai) (i = 1,2) be POVMs. 

(i) A A3 2 -bimeasurable bijection f: ill —>■ O 2 is called a strict isomorphism if (Ai)f = 
A 2 . If there exists such a strict isomorphism, (1 and [Ll 2 ,SS 2 ,A 2 ) are said 
to be strictly isomorphic, denoted by [Ll 1 ,g$ 1 ,Af) fa (Q 2 , & 2 , A 2 ). 
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(ii) A\) and (Q 2 , & 2 , A 2 ) are said to be almost isomorphic, written (12i,2^i, Af) 

(12 2 , 38 2 , A 2 ), if there exist measurable subsets 12,; G SS % (i — 1,2) such that A^Clf) = 
I and the restrictions (12,, 12j D AlaJ (* = 1,2) are strictly isomorphic. Here 


Qi n 9Si := \ 12; fl E 


E g 


and AjU. is the restriction of A t to 12, fl We call 


a measurable subset 0^ with A^Cli) = I a full measure set. 


The above definition is a straightforward generalization of the corresponding concepts 
known in classical measure s 17;18 . The relations ~ and ~ are equivalence relations, which 
can be proved in a similar manner as for the classical measure (e.g. Sec. 2.4 of Ref. Il7l ). 
Intuitively, these concepts correspond to the relabeling of the measurement outcomes. 

The following proposition gives the relationship between these isomorphisms and the 
fuzzy equivalence relation. 


Proposition 3. Let (fl A , & A , A) and (12#, £$b, B) be POVMs. Then the following implica¬ 
tions hold. 


(Q a ,&a,A) fa (Q. b ,&b,B) =>• (Q a ,&a,A) ~ (Q b ,&b,B) 

=>• A ~ r B. 


Proof. The first implication is evident from the definitions of the strict and almost isomor¬ 
phisms. Let us assume {Vl A , P§ A , A) ~ (1 1 b ,&b,B) and show A ~ r B. Let 12 a G PS A and 
Qb G S$b be full measure sets such that (12a, 12 a H P$ A , ^4|q a ) ~ (12 b, 12 b fl &b, B\q b ). We 
first show A ~ r . Since the identity map 

i: 3 x i—^ x G 11,4 


is 12,4 H ^a/<^a- measurable, we have A = (A|^ ) t P r A\^ . We fix a point xo G 12 a and 
define a mapping l: 12 a — > 12^4 by 


x (x G 12a); 

Rx) := < 

[xo (xg!2a\12a). 

Then for each E G 12a H S$ a we have 


( 14 ) 


r\E) 


jE U (12a \ 12a) (xo G E ); 

{E ( x „ iE). 
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Hence l is ASa/LIa O ,^, 4 -measurable and A\n A = Az f r A. Thus we have shown A ~ r A\^ a . 

Similarly we can prove B ~ r B\q . Since the equivalence A\^ ~ r B immediately 
follows from the definition of the strict isomorphism, we obtain A ~ r A\^ a ~ r B ~ r 
B. □ 

We next show that the minimal sufficiency condition for a POVM is invariant under 
almost isomorphism. 

Proposition 4. Let (f t a, AS a. A) be a ~ w -minimal sufficient (resp. ~ r -minimal sufficient) 
POVM and let (LIb, ASb, B) be a POVM almost isomorphic to (Oa, 98a. A). Then B is also 
~ w -minimal sufficient (resp. ~ r -minimal sufficient). 

Proof. We prove the assertion for the ^-minimal sufficient POVM; the proof for ^-minimal 
sufficiency can be obtained by replacing to ~ r in the following proof. 

Let Oa £ AS a and Qb £ A$b be full measure sets such that (Oa,Oa D AS a-, A |^ ) ~ 
(Cls, O b O A&B-, B |q ) by a strict isomorphism /: Oa —^> &b- To show the -minimal suffi¬ 
ciency of B, we take an arbitrary POVM (Hcs A&c, C) weakly fuzzy equivalent to B. Then 
from Proposition [3j C is also weakly fuzzy equivalent to A and there exists a mapping 
g G M((Oc', A$c) —> (VLa,A$a)) such that C g = A. By fixing a point x$ G Oa we define a 
mapping l: Oa —;> Oa by Eq. (ITT]) . Then l is a measurable map such that = A z . Let us 
denote by / the isomorphism / regarded as a map from Oa to Qg. Then / is Oa D AS a! ASb- 
measurable and (A\^ a )j = B. Thus we have B = (A\^ a )j = {A z )j = ((C g )z)j = Cj olog , and 
B is ^-minimal sufficient. □ 

Now we prove the following theorem which is the second main result of this paper. 

Theorem 5. 1 . Let (LIa, ASa, A) an arbitrary POVM. Then there exists a ~ w -7ninimal 

sufficient POVM A weakly fuzzy equivalent to A. Furthermore such A is unique up to 
almost isomorphism. 

2. Let (Oa, A§Ai A) a standard Borel POVM. Then there exists a ^-minimal sufficient 
POVM A regularly fuzzy equivalent to A. Furthermore such A is unique up to almost 
isomorphism. 

Proof. 1. Let T : Qa —> ®°° be the LSB statistic given by OS]) and define A := A T . Then 
from Theorem [3] and Theorem [4] /I is a ^-minimal sufficient POVM weakly fuzzy 
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equivalent to A. From the same discussion in Theorem [4j A satisfies the following 
condition: 


t = 


dP? 
—^(t) 
dP A 


A- a.e. 


(15) 


i> 1 


To prove the uniqueness up to almost isomorphism, take another ~ u ,-minimal sufficient 
POVM (h2 s , AS B , B) weakly fuzzy equivalent to A. Since B A ~ w A, there exist 
mappings / G M((M°°, ASiW^)) —» (VL Bl AS B )) and g G M ((Vl Bl AS B ) —> (M°°, AS(R°°))) 
such that Af = B and B g = A. Then we have A go f = A, and from Theorem [3l 
g o /: M°° —» M°° is a sufficient statistic for A and we have 


//pd ripA 

—*(t) = —Zr(gof{t)) A-a.e. 

/?* p* 


(16) 


for each z > 1. From Eqs. (fT5l) and (fl6j) . we obtain 


t = 


^i(t) 

dP A , 


§ (9 ° /(t)) 


= 9° f{t) A-a.s. 


i> 1 \ / 7 >1 

Thus there exists a full measure set G AS(R°°) such that (g o /)|^ is an identity 
map on Then Vl B := G is a full measure set and f\ci A 

is a strict isomorphism between (0^,0^ fl AS(M°°), A\^_) and (fl B ,fl B fl AS Bl B\^ b ). 
Therefore A and B are almost isomorphic. 


2. As shown in 1, the POVM A induced by the LSB statistic for A is (^-minimal 
sufficient POVM, and therefore ~ r -minimal sufficient POVM, weakly fuzzy equivalent 
to A. Since A and A are standard Borel POVMs, they are regularly fuzzy equivalent. 
The uniqueness up to almost isomorphism can be shown in a similar manner. □ 


IV. MINIMAL SUFFICIENCY FOR DISCRETE POVM 

In this section, we consider the minimal sufficient condition for discrete POVMs. 

A measurable space (O, AS) is said to be discrete if O is a countable set and AS is the power 
set V{VL) of fh A POVM A is said to be discrete if the outcome space of A is a discrete 
space. A discrete POVM (VLa,V{VLa),A) induces a mapping A: fU —» £(7i) defined by 
A(x) := A({x}) > O with a completeness condition 

V A ( x ) = L ( 17 > 
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On the other hand, a positive-operator valued mapping A: Qa —> £("%) with the complete¬ 
ness condition (1171) induces a discrete POVM by 

A(E):=^A(x) 

xGE 

for each E G V(Qa)- Thus throughout this section we identify a positive-operator valued 
mapping A: 0 A —>■ C(fH) satisfying the completeness condition with a discrete POVM. We 
write Pp(x) := tr[pA(x)\ for each p G S(fH). 

A discrete POVM A: Qa —> T(fH) is said to be non-vanishing if A(x) ^ 0 for all x G 0 A . 
Any discrete POVM is almost isomorphic to a non-vanishing POVM. 

Since a discrete space is a Standard Borel space, the weak relations P w and coincide 
with the regular relations A r and ~ r , respectively. Thus for discrete POVMs, fuzzy preorder 
and equivalence relations are denoted as -< and ~, respectively. 

For discrete POVMs A: Q A —» C(fH) and B \ Q B —y C(TL) the relations «, and A are 
simplihed as follows. A k, B if and only if there exists a bijection /: Qa g fig such that 
A f = B, where A f (y) = E x& n A S y,f(x) A (x). A rs./ B if and only if there exist full-measure 
subsets 0,4 C 0 A and Qb C Qb such that the restrictions a \& a and B\^ g are non-vanishing 
and strictly isomorphic. A ■< B if and only if there exists a matrix {^(x\y)}( Xty ) e ci A xCi B such 
that 

k(x\ y) > 0 , Y K ( x \y ) = ( 18 ) 

x£Qa 

A (x) = Y K ( x \y) B (y)- 

y&V-B 

A matrix k(-|-) satisfying the condition (fTHj) is called a Markov matrix. 

The following proposition characterizes the sufficiency condition of a statistic for a discrete 
POVM. 

Proposition 5. Let A: —>■ OfH) be a discrete POVM and let T : 0 A — y Qt be a mapping 

to a measurable space (Qt,&t)- We assume that contains each single point set {t} 
(t G Qt)- Then the following conditions are equivalent. 

(i) T is sufficient for A; 

(ii) there exist functions h(-): 0 A 3 x H > h(x) G [0, oo) and g.(-): S{TL) x 3 ( p,t ) hg 
g p (t) G [0, oo) such that 

Pp(x) = h(x)g p (T(x)), ( p G S(PL),x G 0 A ); 
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(iii) there exist functions h(-): h(x) G [0, oo) and G(-): Qt 3 t H > G(t ) G C + (fH) 

such that 

A(x) = h(x)G(T(x)), (x G Qa), 
where C + (H) := { a G C(fH) \ a > O } ; 

(iv) A ~ Ay. 

Proof. The equivalence (i)<(=>(ii)-v^(iv) is evident from Theorem [31 The implication (iii)=^(ii) 
immediately follows by putting g p (t ) := tr[pG(t)]. Let us assume (ii) and show (iii). If t G fly 
satisfies that 

3a; G 0^4 such that T(x) = t and h(x) 0, (19) 

then we have 

9 P (t) =Pp(x)/h(x). (20) 

Sincejdie RHS of Eq. (1201) is affine and positive with respect to p G S(fH ), according to 
19, there exists G(t) G C+ifH) such that g p [t) = tr[pG(t)] for any p G S(H). For 


. 0 , 


Refs. l 2 and 


t G Vl T that does not satisfy the condition (fl9l) . we define G(t) = O. Then we have 

tr [pA(x)] = p^(x) = tr \ph(x)G(T(x))\ 
for each p G S(fH) and x G Ha, which implies the condition (iii). 


□ 


Corresponding to Theorem [5] we have the following theorem as to the existence and 
uniqueness of a minimal sufficient POVM. 

Theorem 6. Let A: 12a — > C(fK) be a discrete POVM. Then there exists a discrete non¬ 
vanishing ~ w -minimal sufficient POVM A fuzzy equivalent to A. Furthermore such A is 
unique up to strict isomorphism. 

Proof. Since A is almost isomorphic to a non-vanishing POVM, without loss of generality 
we can assume A is non-vanishing. We define an equivalence relation on by 

x x' :<=>• 3c > 0 , A(x) = cA(x'), 

and define a mapping S: O 4 —>■ VLa/ ~a=: by S(x) := [x], where [#] is the equivalence 

class to which x belongs. Then from the definition of we can write 

A(x) = h(x)G(S(x)) 
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for each x G 0^, where h{x) > 0 and 0 A G(s) G £ + (TL) ( s G Os), which implies S is 
sufficient for A. Therefore if we dehne A := As, A ~ A and A is a non-vanishing discrete 
POVM. 

From Theorem 0 and its prove, the LSB statistic T : fig —y M°° dehned by (JOJ) induces a 
^-minimal sufficient POVM (M°°, A T ) fuzzy equivalent to A. Since T is sufficient 

for A, from Proposition [5l we can write 

A(s) = h'(s)G'(T(s )), 

where h'(s) > 0 and O A G'(t ) G £+(H) (t G M°°). From the construction of S, this 
implies that T is injective, and T is a strict isomorphism between (Os, 'P(Os), A) and 
(T(Os), T(Qs) n^(M°°), (^4s)|r(n s ))- Note that T(Qs) G ^(M 00 ) since T(Qs) is a countable 
set. Thus from Proposition U A is a ^-minimal sufficient POVM. 

To show the uniqueness, let B : 0# —>■ C(H) be an arbitrary ^-minimal sufficient and 
non-vanishing discrete POVM equivalent to A. Since A ~ B, there exists a mapping 
/: O 5 —» Ob such that Af = B. If / is not surjective, there exists an element y G Ob\/(Ob) 
and we have B(y) = Af(y ) = Yh x - f{ x )= y A(x) = O , which contradicts the non-vanishing 
property of B. Thus / is surjective. Since / is sufficient for A, by a similar discussion for 
T, we can show that / is injective. Therefore / is a strict isomorphism between A and B, 
which completes the proof. □ 

A discrete POVM A: —y C{T-i) is said to be pairwise linearly independent^ if any pair 
{A(x),A(x')} (x A x> ) is linearly independent. A pairwise linearly independent POVM is 
non-vanishing by definition. The following theorem states that the minimal sufficiency and 
the pairwise linearly independence are almost equivalent. 

Theorem 7. Let A: 0^4 —> C(H) be a discrete POVM. Then the following conditions are 
equivalent: 

(i) A is pairwise linearly independent; 

(ii) A is non-vanishing and ~ w -minimal sufficient. 

(in) A is non-vanishing and ~ r -minimal sufficient. 

Proof. We first show (i)=>(ii). Assume that A is a pairwise linearly independent POVM. 
Then if we consider the mapping S in the proof of Theorem [6l S is an injection and the 
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POVM A induced by S is strictly isomorphic to A. Since A is ^-minimal sufficient, A is 
also -minimal sufficient. Thus we have shown (i)=^(ii). 

(ii)=^(iii) is obvious. 

If A is non-vanishing and ~ r -minimal sufficient, then A induced by the statistic S is 
strictly isomorphic to A by the uniqueness of the minimal sufficient POVM. Since A is 
pairwise linearly independent, A is also pairwise linearly independent. □ 

In Ref. y it is shown that for each discrete POVM A there exists a pairwise linearly inde¬ 
pendent POVM fuzzy equivalent to A and such a POVM is unique up to strict isomorphism. 
This assertion is a direct corollary of our Theorem [H] and Theorem |TJ 

V. INFORMATION CONSERVATION CONDITION 

In this brief section, we consider information conservation conditions proposed by the 
author-*^. 

Let be a measurable space. A completely positive (CP) instrument 2 - I 1 (-) (in 

the Heisenberg picture) with the outcome space (Qi,^§i) is a mapping 

I 1 (-): x £(H) 3 (Ei, a) i—>• X l El (a) G C(U) 


such that 

(i) for each countable disjoint {Ej} C AS\ and each p e S(T-L) and a G C('H), tr [pI^ Ej (a)] = 
S,tr[plE,(a)]; 

(ii) 10,(1) = /; 

(iii) Ig(-) is a normal CP linear map for every E G SA\ ■ 

A CP instrument simultaneously describes the probability distribution of the outcome of a 
quantum measurement process and the state change due to the measurement. 

Let I 1 ^) be a CP instrument with a standard Borel outcome space (Vti,&i) and let 
(fi 2 , A 2 ) be a standard Borel POVM. A composition-*^ X 1 * A 2 is a unique POVM with 

the product outcome space (fR x Q 2 ,&i x ^ 2 ) such that 

(X 1 * A 2 )(E 1 x E 2 ) — Ze 1 (A 2 (E 2 )) 
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for each E\ G SS\ and E 2 G S$ 2 . The composition corresponds to the joint successive 
measurement process of X) (■) followed by A 2 . 

For a given CP instrument X 1 (•) with a standard Borel outcome space and a given stan¬ 
dard Borcl POVM A 2 , we consider the following two conditions. 


1. There exists a sufficient statistic x: fR x SX2 — > fR such that (X 1 * kG)^ = JR- 


2. X 1 * A 2 — r A 2 . 

In Ref. [ 9 I, the author derived the condition |T] as a sufficient condition for a so called relative- 
entropy conservation law. In Ref. |10|, the author reformulated this condition in the form of 
[2] and called it an information conservation condition, while the condition |T] is a sufficient 
condition but not a necessary one for the condition [2J Noting that the condition [2] is invariant 
under replacing A 2 with another regularly fuzzy equivalent standard Borel POVM~, this 
discrepancy can be resolved by taking A 2 an equivalent -minimal sufficient POVM, which 
is always possible due to Theorem 0 and in this sense the two conditions are essentially 
equivalent. 
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Appendix A: Inequivalence of ~ r and 

In this appendix, we construct a pair of POVMs that are weakly fuzzy equivalent but 
not regularly fuzzy equivalent. 

Let (Ua, &a, A) be a POVM. The completion of (£2 a, &a, A) is a POVM (Ua, 3§a, A) 
defined by 


JAa := { V c Ha I =hV' G AS a s.t. N c N' and A(N') = O } , 
J*a := { E C Ua I 3X G g$ A s-t. EAF g/i}, 

A(E) := A(F ), (E e^A,F G BS A , EAF G MR). 
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Here EAF := (E \ F) U (F \ E) is the symmetric difference of sets. An element of jYa is 
called an A-null set. 

Lemma 3. Let (Ha, AS a, A) be a POVM and let (Ha, A$a, A) be the completion of (Ha, AS a, A) 
Then A ~ w A. 

Proof. Since A(E) = A(E ) = J Qa XE{x)dA(x) for each E G AS Al A P r A holds. For each 
E G S3 a we take F G AS a such that EAF G jYa and define k(E\x) := Xf(x). Then k(-|-) 
is an A-weak Markov kernel such that A(E) = f Q K,(E\x)dA(x) for each E G AS a- Thus we 
have A Aw A, and the assertion holds. □ 

Let /i be the usual Lebesgue measure on a unit interval 7 := [0,1], i.e. /i is the unique 
measure dehned on the a-algebra 3&(I) of 7 generated by open subsets of 7 such that 
p([a,b}) = b — a for each 0 < a < 6 < 1. As the system Hilbert space PL, we consider 
the set of square-integrable PS ( I )-measurable functions L 2 (J, AS(I), p) in which p- a.e. equal 
functions are identified. We define a projection-valued measure (PVM) (7, AS(I), A) by 


{A{E)f)(x) := XE(x)f(x) 


for each E G AS (I) and f E PL. We denote the completion of (7, AS(I),A) by (I,AS(I),A). 
Since the class of A-null sets and that of /i-null sets coincide, AS (I) is the class of Lebesgue 
measurable subsets of 7. From Lemma EJ we have A ~ w A. 

Now we prove that A j A, from which we immediately obtain the desired relation A qk r 
A. Suppose that A Ar A holds. Then there exists a regular Markov kernel /c(-1 -): AS(I) x 7 —> 
[0,1] such that 


ME) 


n(E\x)dA(x) 


for each E G AS(I). Thus we have 


XE(x)dA(x) = A(E) = A{E) = / n{E\x)dA{x) 


(Al) 


for each E G AS {I). From Remark 5 of Ref. Q, Eq. (lAlj) implies that Xe(x) = k(E\x) for 
/i-a.e. x G 7. Therefore there exists a /i-null set iV G <5^(7) such that 


K([0,r]|x) = X[o,r](^) (Vr G 7nQ,Va; G 7\TV), (A2) 

where Q is the set of rational numbers. Noting that k(-\x) is a probability measure for each 
x G 7, Eq. (1A2D indicates that k(77|x) = Xe(x) for each x G 7 \ AT and E G ^(7). Thus we 
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have k(I \ {x}|a;) = 0 for each x G 7 \ N and this implies that n(E\x) = Xe(x) for each 
xeI\N and E e &(I). If there exists a set E such that 

ECI\N, E e e#(J) \08(I), (A3) 


then we have Xe{%) — K {E\ x )Xi\n(x), which contradicts the ^(J)-nonmeasurability of 
Xe(x). 

Now we show the existence of E satisfying (I A3[i . Since ( I\N , (I\N)r\3&(I)) is a standard 


Borel space and the restriction of fi to I\N is a continuous measure, from Theorem 17.41 of 
2ll . there exists a (7 \ N) fl ^(7)/^(7)-bimeasurable bijection /: I \ N —* 7 such that 


Ref. 

n{f{E)) = i-i(E) for every £ 6 (I \ N) fl <5^(7). Since there exists a set E e <^(7) \ ^(7), 
E \= / _1 (i7) satisfies the condition (lA3j) . which completes the proof of the assertion. 
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